Vocabulary Patterns in Free-for-all Collaborative Indexing Systems
نویسندگان
چکیده
In collaborative indexing systems users generate a big amount of metadata by labelling web-based content. These labels are known as tags and form a shared vocabulary. In order to understand the characteristics of that vocabulary, we study structural patterns of these tags by implying the theory of self-organizing systems. Therefore, we utilize the graph theoretic notion to model the network of tags and their valued connections, which represent frequency rates of co-occurring tags. Empirical data is provided by the free-for-all collaborative indexing systems Delicious, Connotea and CiteULike. First, we measure the frequency distribution of co-occurring tags. Secondly, we correlate these tags towards their rank over time. Results indicate a strong relationship among a few tags as well as a notable persistence of these tags over time. Therefore, we make the educated guess that the observed collaborative indexing systems are self-organizing systems towards a shared vocabulary building. Implications on the results are the presence of semantic domains based on high frequency rates of co-occurring tags, which reflect topics of interest among the user community. When observing those semantic domains over time, that information can be used to provide a historical or trend-setting development of the community’s interests, thus enhancing collaborative indexing systems in general as well as providing a new tool to develop community-based products and services at the same time.
منابع مشابه
Tagging, Folksonomy & Co - Renaissance of Manual Indexing?
This paper gives an overview of current trends in manual indexing on the Web. Along with a general rise of user generated content there are more and more tagging systems that allow users to annotate digital resources with tags (keywords) and share their annotations with other users. Tagging is frequently seen in contrast to traditional knowledge organization systems or as something completely n...
متن کامل0 Ja n 20 07 Tagging , Folksonomy & Co - Renaissance of Manual Indexing ? ∗
This paper gives an overview of current trends in manual indexing on the Web. Along with a general rise of user generated content there are more and more tagging systems that allow users to annotate digital resources with tags (keywords) and share their annotations with other users. Tagging is frequently seen in contrast to traditional knowledge organization systems or as something completely n...
متن کاملThe Impact of Pre-Defined Terms on the Vocabulary of Collaborative Indexing Systems
Collaborative indexing systems have attracted an increasing amount of attention over the last three years. One fundamental limitation to such a system is the uncontrolled nature of its vocabulary, as this consists of terms users freely choose to index resources. As a result, the vocabulary can be poorly structured, making it difficult to harvest knowledge from the user community. Pre-defined te...
متن کاملUse of Semantic Similarity and Web Usage Mining to Alleviate the Drawbacks of User-Based Collaborative Filtering Recommender Systems
One of the most famous methods for recommendation is user-based Collaborative Filtering (CF). This system compares active user’s items rating with historical rating records of other users to find similar users and recommending items which seems interesting to these similar users and have not been rated by the active user. As a way of computing recommendations, the ultimate goal of the user-ba...
متن کاملIndexing and Retrieving Images in a Multilingual World
Introduction This communication presents the problem statement, the methodology and the preliminary results of a research project aiming to compare two different approaches for indexing images, namely: traditional image indexing with the use of controlled vocabularies, or free image indexing using uncontrolled vocabulary. The experiment intends to measure their respective performance for image ...
متن کامل